AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsOct-2-2025, 02:53:11 GMT

Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning Cong Zhang 1, Wen Song

In the paper, we adopt the Proximal Policy Optimization (PPO) algorithm [36] to train our agent. Here we provide details of our algorithm in terms of pseudo code, as shown in Algorithm 1. Similar In this section, we show how the baseline PDRs compute the priority index for the operations. Here we present the complete results on Taillard's benchmark. In Table S.1, we report the results of In Table S.2, we report the generalization performance of our polices trained on The "UB" column is the best solution from The "UB" column is the best solution from Similar conclusion can be drawn from results on DMU benchmark. In Table S.3, we report results In Table S.4 which focuses on The "UB" column is the best solution from The "UB" column is the best solution from We show training curves for all problems in Figure.1.

artificial intelligence, machine learning, reinforcement learning cong zhang 1, (9 more...)

Country: Asia (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.86)

arXiv.org Artificial IntelligenceSep-29-2025

DyRo-MCTS: A Robust Monte Carlo Tree Search Approach to Dynamic Job Shop Scheduling

Chen, Ruiqi, Mei, Yi, Zhang, Fangfang, Zhang, Mengjie

Dynamic job shop scheduling, a fundamental combinatorial optimisation problem in various industrial sectors, poses substantial challenges for effective scheduling due to frequent disruptions caused by the arrival of new jobs. State-of-the-art methods employ machine learning to learn scheduling policies offline, enabling rapid responses to dynamic events. However, these offline policies are often imperfect, necessitating the use of planning techniques such as Monte Carlo Tree Search (MCTS) to improve performance at online decision time. The unpredictability of new job arrivals complicates online planning, as decisions based on incomplete problem information are vulnerable to disturbances. To address this issue, we propose the Dynamic Robust MCTS (DyRo-MCTS) approach, which integrates action robustness estimation into MCTS. DyRo-MCTS guides the production environment toward states that not only yield good scheduling outcomes but are also easily adaptable to future job arrivals. Extensive experiments show that DyRo-MCTS significantly improves the performance of offline-learned policies with negligible additional online planning time. Moreover, DyRo-MCTS consistently outperforms vanilla MCTS across various scheduling scenarios. Further analysis reveals that its ability to make robust scheduling decisions leads to long-term, sustainable performance gains under disturbances.

artificial intelligence, dyro-mct, planning & scheduling, (13 more...)

2509.21902

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Neural Information Processing SystemsAug-17-2025, 12:33:51 GMT

A Lagrangian Dual based approach

The Job Shop Scheduling (JSS) problem can be viewed as an integer optimization program with linear objective function and linear, disjunctive constraints. The Lagrangian-based deep learning model does not necessarily produce feasible schedules directly. The model presented below is used to construct solutions that are integral, and feasible to the original problem constraints. The experimental setting, as defined by the training and test data, simulates a situation in which some component of a manufacturing system'slows down', causing processing times to extend on The model training follows the selection of parameters presented in Table 3.Parameter V alue Parameter V alue Epochs 500 Batch Size 16 Learning rate [1 . Finally, Constraints (23) capture Kirchho ff's Current Law and Constraints (24) capture Ohm's Law.

artificial intelligence, constraint, machine learning, (15 more...)

Industry: Energy > Power Industry (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Neural Information Processing SystemsJan-21-2025, 19:36:43 GMT

Review for NeurIPS paper: Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning

Additional Feedback: Global comments: I think the authors did not use the proper NeurIPS LaTex template. Line numbers are missing, as well as footnotes. The global look and number of pages seem ok though. Figures 1, 2 and 3 are of poor quality, as they appear visually pixelated. I strongly suggest the authors to use images in vector graphics.

deep reinforcement learning, job shop scheduling, learning, (11 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Neural Information Processing SystemsJan-21-2025, 19:36:35 GMT

Review for NeurIPS paper: Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning

The reviewers all agree that the paper is above the acceptance threshold. There is novelty in how the paper applies learning to JSSP, and the results are promising. But as Reviewer 4 points out, the paper hasn't shown how the proposed approach trades off solution quality and running time, without which it is difficult to judge whether it is a significant advance over existing techniques. Adding such results will strengthen the paper considerably.

deep reinforcement learning, job shop scheduling, learning, (2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Neural Information Processing SystemsOct-9-2024, 13:35:53 GMT

Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

de Puiseau, Constantin Waubert, Dörpelkus, Christian, Peters, Jannik, Tercan, Hasan, Meisen, Tobias

Beyond Training: Optimizing Reinforcement Learning Based Job Shop Scheduling Through Adaptive Action Sampling

arXiv.org Artificial IntelligenceJun-11-2024

Learned construction heuristics for scheduling problems have become increasingly competitive with established solvers and heuristics in recent years. In particular, significant improvements have been observed in solution approaches using deep reinforcement learning (DRL). While much attention has been paid to the design of network architectures and training algorithms to achieve state-of-the-art results, little research has investigated the optimal use of trained DRL agents during inference. Our work is based on the hypothesis that, similar to search algorithms, the utilization of trained DRL agents should be dependent on the acceptable computational budget. We propose a simple yet effective parameterization, called $\delta$-sampling that manipulates the trained action vector to bias agent behavior towards exploration or exploitation during solution construction. By following this approach, we can achieve a more comprehensive coverage of the search space while still generating an acceptable number of solutions. In addition, we propose an algorithm for obtaining the optimal parameterization for such a given number of solutions and any given trained agent. Experiments extending existing training protocols for job shop scheduling problems with our inference method validate our hypothesis and result in the expected improvements of the generated solutions.

algorithm, learning, sample size, (15 more...)

2406.07325

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.88)

Reijnen, Robbert, van Straaten, Kjell, Bukhsh, Zaharah, Zhang, Yingqian

Job Shop Scheduling Benchmark: Environments and Instances for Learning and Non-learning Methods

arXiv.org Artificial IntelligenceAug-24-2023

We introduce an open-source GitHub repository containing comprehensive benchmarks for a wide range of machine scheduling problems, including Job Shop Scheduling (JSP), Flow Shop Scheduling (FSP), Flexible Job Shop Scheduling (FJSP), FJSP with Assembly constraints (FAJSP), FJSP with Sequence-Dependent Setup Times (FJSP-SDST), and the online FJSP (with online job arrivals). Our primary goal is to provide a centralized hub for researchers, practitioners, and enthusiasts interested in tackling machine scheduling challenges.

artificial intelligence, machine learning, planning & scheduling, (15 more...)

2308.12794

Country: Europe > Netherlands > North Brabant > Eindhoven (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.41)

Bonetta, Giovanni, Zago, Davide, Cancelliere, Rossella, Grosso, Andrea

Job Shop Scheduling via Deep Reinforcement Learning: a Sequence to Sequence approach

arXiv.org Artificial IntelligenceAug-3-2023

Job scheduling is a well-known Combinatorial Optimization problem with endless applications. Well planned schedules bring many benefits in the context of automated systems: among others, they limit production costs and waste. Nevertheless, the NP-hardness of this problem makes it essential to use heuristics whose design is difficult, requires specialized knowledge and often produces methods tailored to the specific task. This paper presents an original end-to-end Deep Reinforcement Learning approach to scheduling that automatically learns dispatching rules. Our technique is inspired by natural language encoder-decoder models for sequence processing and has never been used, to the best of our knowledge, for scheduling purposes. We applied and tested our method in particular to some benchmark instances of Job Shop Problem, but this technique is general enough to be potentially used to tackle other different optimal job scheduling tasks with minimal intervention. Results demonstrate that we outperform many classical approaches exploiting priority dispatching rules and show competitive results on state-of-the-art Deep Reinforcement Learning ones.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

2308.01797

Country:

Europe > Italy > Piedmont > Turin Province > Turin (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)